Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher.
Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?
Some links on this page may take you to non-federal websites. Their policies may differ from this site.
-
Abstract BackgroundPredicting phenotypes from genetic variation is foundational for fields as diverse as bioengineering and global change biology, highlighting the importance of efficient methods to predict gene functions. Linking genetic changes to phenotypic changes has been a goal of decades of experimental work, especially for some model gene families, including light-sensitive opsin proteins. Opsins can be expressed in vitro to measure light absorption parameters, including λmax—the wavelength of maximum absorbance—which strongly affects organismal phenotypes like color vision. Despite extensive research on opsins, the data remain dispersed, uncompiled, and often challenging to access, thereby precluding systematic and comprehensive analyses of the intricate relationships between genotype and phenotype. ResultsHere, we report a newly compiled database of all heterologously expressed opsin genes with λmax phenotypes that we call the Visual Physiology Opsin Database (VPOD). VPOD_1.0 contains 864 unique opsin genotypes and corresponding λmax phenotypes collected across all animals from 73 separate publications. We use VPOD data and deepBreaks to show regression-based machine learning (ML) models often reliably predict λmax, account for nonadditive effects of mutations on function, and identify functionally critical amino acid sites. ConclusionThe ability to reliably predict functions from gene sequences alone using ML will allow robust exploration of molecular-evolutionary patterns governing phenotype, will inform functional and evolutionary connections to an organism’s ecological niche, and may be used more broadly for de novo protein design. Together, our database, phenotype predictions, and model comparisons lay the groundwork for future research applicable to families of genes with quantifiable and comparable phenotypes.more » « less
-
ABSTRACT Evolutionary convergence provides natural opportunities to investigate how, when, and why novel traits evolve. Many convergent traits are complex, highlighting the importance of explicitly considering convergence at different levels of biological organization, or ‘multi‐level convergent evolution’. To investigate multi‐level convergent evolution, we propose a holistic and hierarchical framework that emphasizes breaking down traits into several functional modules. We begin by identifying long‐standing questions on the origins of complexity and the diverse evolutionary processes underlying phenotypic convergence to discuss how they can be addressed by examining convergent systems. We argue that bioluminescence, a complex trait that evolved dozens of times through either novel mechanisms or conserved toolkits, is particularly well suited for these studies. We present an updated estimate of at least 94 independent origins of bioluminescence across the tree of life, which we calculated by reviewing and summarizing all estimates of independent origins. Then, we use our framework to review the biology, chemistry, and evolution of bioluminescence, and for each biological level identify questions that arise from our systematic review. We focus on luminous organisms that use the shared luciferin substrates coelenterazine or vargulin to produce light because these organisms convergently evolved bioluminescent proteins that use the same luciferins to produce bioluminescence. Evolutionary convergence does not necessarily extend across biological levels, as exemplified by cases of conservation and disparity in biological functions, organs, cells, and molecules associated with bioluminescence systems. Investigating differences across bioluminescent organisms will address fundamental questions on predictability and contingency in convergent evolution. Lastly, we highlight unexplored areas of bioluminescence research and advances in sequencing and chemical techniques useful for developing bioluminescence as a model system for studying multi‐level convergent evolution.more » « less
An official website of the United States government
